Dataset statistics
| Number of variables | 9 |
|---|---|
| Number of observations | 581 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 41.0 KiB |
| Average record size in memory | 72.2 B |
Variable types
| Numeric | 9 |
|---|
Pregnancies is highly correlated with Age | High correlation |
Age is highly correlated with Pregnancies | High correlation |
Pregnancies is highly correlated with Age | High correlation |
Age is highly correlated with Pregnancies | High correlation |
Pregnancies is highly correlated with Age | High correlation |
SkinThickness is highly correlated with Insulin and 1 other fields | High correlation |
Insulin is highly correlated with SkinThickness | High correlation |
BMI is highly correlated with SkinThickness | High correlation |
Age is highly correlated with Pregnancies | High correlation |
level_0 has unique values | Unique |
Age has 12 (2.1%) zeros | Zeros |
Reproduction
| Analysis started | 2022-09-20 06:40:04.989387 |
|---|---|
| Analysis finished | 2022-09-20 06:40:21.936652 |
| Duration | 16.95 seconds |
| Software version | pandas-profiling v3.2.0 |
| Download configuration | config.json |
| Distinct | 581 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0 |
| Minimum | -1.722624772 |
|---|---|
| Maximum | 1.73484803 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 290 |
| Negative (%) | 49.9% |
| Memory size | 4.7 KiB |
Quantile statistics
| Minimum | -1.722624772 |
|---|---|
| 5-th percentile | -1.549523367 |
| Q1 | -0.8707836467 |
| median | 0.003833978942 |
| Q3 | 0.8465645036 |
| 95-th percentile | 1.570857225 |
| Maximum | 1.73484803 |
| Range | 3.457472801 |
| Interquartile range (IQR) | 1.71734815 |
Descriptive statistics
| Standard deviation | 1.000861698 |
|---|---|
| Coefficient of variation (CV) | nan |
| Kurtosis | -1.190774234 |
| Mean | 0 |
| Median Absolute Deviation (MAD) | 0.8609517252 |
| Skewness | 0.01587279078 |
| Sum | 1.465494393 × 10-14 |
| Variance | 1.001724138 |
| Monotonicity | Strictly increasing |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| -1.722624772 | 1 | 0.2% |
| 0.4183662911 | 1 | 0.2% |
| 0.550469995 | 1 | 0.2% |
| 0.5550252951 | 1 | 0.2% |
| 0.5595805952 | 1 | 0.2% |
| 0.5641358954 | 1 | 0.2% |
| 0.5686911955 | 1 | 0.2% |
| 0.5732464956 | 1 | 0.2% |
| 0.5823570959 | 1 | 0.2% |
| 0.586912396 | 1 | 0.2% |
| Other values (571) | 571 |
| Value | Count | Frequency (%) |
| -1.722624772 | 1 | |
| -1.718069472 | 1 | |
| -1.708958871 | 1 | |
| -1.699848271 | 1 | |
| -1.695292971 | 1 | |
| -1.690737671 | 1 | |
| -1.686182371 | 1 | |
| -1.68162707 | 1 | |
| -1.67707177 | 1 | |
| -1.66796117 | 1 |
| Value | Count | Frequency (%) |
| 1.73484803 | 1 | |
| 1.73029273 | 1 | |
| 1.725737429 | 1 | |
| 1.721182129 | 1 | |
| 1.712071529 | 1 | |
| 1.707516229 | 1 | |
| 1.702960929 | 1 | |
| 1.693850328 | 1 | |
| 1.689295028 | 1 | |
| 1.684739728 | 1 |
| Distinct | 17 |
|---|---|
| Distinct (%) | 2.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -3.668895365 × 10-17 |
| Minimum | -1.136042381 |
|---|---|
| Maximum | 3.932020219 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 326 |
| Negative (%) | 56.1% |
| Memory size | 4.7 KiB |
Quantile statistics
| Minimum | -1.136042381 |
|---|---|
| 5-th percentile | -1.136042381 |
| Q1 | -0.8379210515 |
| median | -0.2416783927 |
| Q3 | 0.6526855955 |
| 95-th percentile | 1.845170913 |
| Maximum | 3.932020219 |
| Range | 5.0680626 |
| Interquartile range (IQR) | 1.490606647 |
Descriptive statistics
| Standard deviation | 1.000861698 |
|---|---|
| Coefficient of variation (CV) | -2.727964682 × 1016 |
| Kurtosis | 0.3666248812 |
| Mean | -3.668895365 × 10-17 |
| Median Absolute Deviation (MAD) | 0.5962426588 |
| Skewness | 0.9692620925 |
| Sum | -2.131628207 × 10-14 |
| Variance | 1.001724138 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=17)
| Value | Count | Frequency (%) |
| -0.8379210515 | 103 | |
| -0.5397997221 | 82 | |
| -1.136042381 | 81 | |
| -0.2416783927 | 60 | |
| 0.05644293672 | 52 | |
| 0.3545642661 | 43 | |
| 0.6526855955 | 38 | 6.5% |
| 0.9508069249 | 33 | 5.7% |
| 1.547049584 | 22 | 3.8% |
| 1.248928254 | 22 | 3.8% |
| Other values (7) | 45 |
| Value | Count | Frequency (%) |
| -1.136042381 | 81 | |
| -0.8379210515 | 103 | |
| -0.5397997221 | 82 | |
| -0.2416783927 | 60 | |
| 0.05644293672 | 52 | |
| 0.3545642661 | 43 | |
| 0.6526855955 | 38 | 6.5% |
| 0.9508069249 | 33 | 5.7% |
| 1.248928254 | 22 | 3.8% |
| 1.547049584 | 22 | 3.8% |
| Value | Count | Frequency (%) |
| 3.932020219 | 1 | 0.2% |
| 3.33577756 | 1 | 0.2% |
| 3.037656231 | 2 | 0.3% |
| 2.739534901 | 7 | 1.2% |
| 2.441413572 | 6 | 1.0% |
| 2.143292243 | 8 | 1.4% |
| 1.845170913 | 20 | |
| 1.547049584 | 22 | |
| 1.248928254 | 22 | |
| 0.9508069249 | 33 |
Glucose
Real number (ℝ)
| Distinct | 116 |
|---|---|
| Distinct (%) | 20.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -4.815425167 × 10-17 |
| Minimum | -2.304010909 |
|---|---|
| Maximum | 2.629880873 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 319 |
| Negative (%) | 54.9% |
| Memory size | 4.7 KiB |
Quantile statistics
| Minimum | -2.304010909 |
|---|---|
| 5-th percentile | -1.40337987 |
| Q1 | -0.6985381864 |
| median | -0.1503279883 |
| Q3 | 0.554513695 |
| 95-th percentile | 1.964197061 |
| Maximum | 2.629880873 |
| Range | 4.933891783 |
| Interquartile range (IQR) | 1.253051881 |
Descriptive statistics
| Standard deviation | 1.000861698 |
|---|---|
| Coefficient of variation (CV) | -2.078449281 × 1016 |
| Kurtosis | -0.08084846299 |
| Mean | -4.815425167 × 10-17 |
| Median Absolute Deviation (MAD) | 0.6265259407 |
| Skewness | 0.5422891697 |
| Sum | -2.842170943 × 10-14 |
| Variance | 1.001724138 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| -0.6202224438 | 17 | 2.9% |
| -0.5027488299 | 13 | 2.2% |
| -0.776853929 | 13 | 2.2% |
| 0.3978822098 | 13 | 2.2% |
| -0.5810645725 | 13 | 2.2% |
| -0.1503279883 | 13 | 2.2% |
| -0.2678016022 | 12 | 2.1% |
| -0.3461173447 | 12 | 2.1% |
| -0.111170117 | 11 | 1.9% |
| -0.2286437309 | 11 | 1.9% |
| Other values (106) | 453 |
| Value | Count | Frequency (%) |
| -2.304010909 | 1 | 0.2% |
| -2.264853038 | 1 | 0.2% |
| -2.108221553 | 1 | 0.2% |
| -2.069063682 | 1 | 0.2% |
| -1.951590068 | 1 | 0.2% |
| -1.873274325 | 1 | 0.2% |
| -1.834116454 | 2 | |
| -1.71664284 | 3 | |
| -1.677484969 | 1 | 0.2% |
| -1.638327097 | 3 |
| Value | Count | Frequency (%) |
| 2.629880873 | 1 | 0.2% |
| 2.590723002 | 2 | 0.3% |
| 2.551565131 | 3 | |
| 2.51240726 | 5 | |
| 2.473249388 | 1 | 0.2% |
| 2.355775774 | 2 | 0.3% |
| 2.316617903 | 1 | 0.2% |
| 2.277460032 | 3 | |
| 2.199144289 | 2 | 0.3% |
| 2.159986418 | 1 | 0.2% |
BloodPressure
Real number (ℝ)
| Distinct | 35 |
|---|---|
| Distinct (%) | 6.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -1.964387727 × 10-16 |
| Minimum | -2.584642613 |
|---|---|
| Maximum | 2.56416979 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 303 |
| Negative (%) | 52.2% |
| Memory size | 4.7 KiB |
Quantile statistics
| Minimum | -2.584642613 |
|---|---|
| 5-th percentile | -1.631158835 |
| Q1 | -0.6776750564 |
| median | -0.1055847893 |
| Q3 | 0.6572022334 |
| 95-th percentile | 1.801382768 |
| Maximum | 2.56416979 |
| Range | 5.148812403 |
| Interquartile range (IQR) | 1.33487729 |
Descriptive statistics
| Standard deviation | 1.000861698 |
|---|---|
| Coefficient of variation (CV) | -5.095031312 × 1015 |
| Kurtosis | -0.2903835923 |
| Mean | -1.964387727 × 10-16 |
| Median Absolute Deviation (MAD) | 0.7627870227 |
| Skewness | 0.03972050431 |
| Sum | -1.172395514 × 10-13 |
| Variance | 1.001724138 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=35)
| Value | Count | Frequency (%) |
| -0.1055847893 | 42 | 7.2% |
| 0.275808722 | 42 | 7.2% |
| 0.08511196636 | 37 | 6.4% |
| -0.6776750564 | 36 | 6.2% |
| -0.296281545 | 35 | 6.0% |
| 0.6572022334 | 34 | 5.9% |
| 0.8478989891 | 33 | 5.7% |
| -0.1908768929 | 32 | 5.5% |
| -1.059068568 | 32 | 5.5% |
| -0.868371812 | 26 | 4.5% |
| Other values (25) | 232 |
| Value | Count | Frequency (%) |
| -2.584642613 | 3 | 0.5% |
| -2.393945858 | 1 | 0.2% |
| -2.203249102 | 4 | 0.7% |
| -2.012552346 | 10 | 1.7% |
| -1.82185559 | 9 | 1.5% |
| -1.631158835 | 11 | 1.9% |
| -1.535810457 | 2 | 0.3% |
| -1.440462079 | 11 | 1.9% |
| -1.249765323 | 17 | |
| -1.059068568 | 32 |
| Value | Count | Frequency (%) |
| 2.56416979 | 1 | 0.2% |
| 2.373473035 | 4 | 0.7% |
| 2.278124657 | 1 | 0.2% |
| 2.182776279 | 4 | 0.7% |
| 1.992079523 | 6 | 1.0% |
| 1.801382768 | 14 | |
| 1.610686012 | 17 | |
| 1.419989256 | 16 | |
| 1.324640878 | 4 | 0.7% |
| 1.2292925 | 15 |
| Distinct | 39 |
|---|---|
| Distinct (%) | 6.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -8.866497133 × 10-17 |
| Minimum | -2.139027761 |
|---|---|
| Maximum | 2.638164328 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 356 |
| Negative (%) | 61.3% |
| Memory size | 4.7 KiB |
Quantile statistics
| Minimum | -2.139027761 |
|---|---|
| 5-th percentile | -1.384734273 |
| Q1 | -0.562999614 |
| median | -0.562999614 |
| Q3 | 0.6267150276 |
| 95-th percentile | 1.88387084 |
| Maximum | 2.638164328 |
| Range | 4.777192089 |
| Interquartile range (IQR) | 1.189714642 |
Descriptive statistics
| Standard deviation | 1.000861698 |
|---|---|
| Coefficient of variation (CV) | -1.128812972 × 1016 |
| Kurtosis | -0.1786351292 |
| Mean | -8.866497133 × 10-17 |
| Median Absolute Deviation (MAD) | 0.444587915 |
| Skewness | 0.6884450763 |
| Sum | -2.842170943 × 10-14 |
| Variance | 1.001724138 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=39)
| Value | Count | Frequency (%) |
| -0.562999614 | 195 | |
| 0.8781461902 | 26 | 4.5% |
| 0.6267150276 | 24 | 4.1% |
| -0.2532940414 | 18 | 3.1% |
| 0.2495682838 | 17 | 2.9% |
| -0.8818719478 | 16 | 2.8% |
| 0.7524306089 | 15 | 2.6% |
| -0.7561563665 | 15 | 2.6% |
| 0.375283865 | 15 | 2.6% |
| -0.0018628788 | 14 | 2.4% |
| Other values (29) | 226 |
| Value | Count | Frequency (%) |
| -2.139027761 | 2 | 0.3% |
| -1.887596598 | 4 | 0.7% |
| -1.761881017 | 6 | 1.0% |
| -1.636165435 | 6 | 1.0% |
| -1.510449854 | 10 | |
| -1.384734273 | 4 | 0.7% |
| -1.259018692 | 12 | |
| -1.13330311 | 5 | 0.9% |
| -1.007587529 | 12 | |
| -0.8818719478 | 16 |
| Value | Count | Frequency (%) |
| 2.638164328 | 6 | |
| 2.512448747 | 3 | 0.5% |
| 2.386733166 | 2 | 0.3% |
| 2.261017584 | 3 | 0.5% |
| 2.135302003 | 5 | 0.9% |
| 2.009586422 | 10 | |
| 1.88387084 | 14 | |
| 1.758155259 | 11 | |
| 1.632439678 | 5 | 0.9% |
| 1.506724097 | 10 |
| Distinct | 110 |
|---|---|
| Distinct (%) | 18.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.936591615 × 10-18 |
| Minimum | -2.319812987 |
|---|---|
| Maximum | 3.117432993 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 432 |
| Negative (%) | 74.4% |
| Memory size | 4.7 KiB |
Quantile statistics
| Minimum | -2.319812987 |
|---|---|
| 5-th percentile | -1.275370256 |
| Q1 | -0.3292440173 |
| median | -0.3292440173 |
| Q3 | 0.04554260895 |
| 95-th percentile | 2.380179301 |
| Maximum | 3.117432993 |
| Range | 5.437245979 |
| Interquartile range (IQR) | 0.3747866262 |
Descriptive statistics
| Standard deviation | 1.000861698 |
|---|---|
| Coefficient of variation (CV) | 1.007248498 × 1017 |
| Kurtosis | 1.763993393 |
| Mean | 9.936591615 × 10-18 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.296839799 |
| Sum | 1.776356839 × 10-14 |
| Variance | 1.001724138 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| -0.3292440173 | 321 | |
| 0.4448883588 | 11 | 1.9% |
| 0.1069804166 | 7 | 1.2% |
| 1.212860955 | 7 | 1.2% |
| 0.9056719164 | 7 | 1.2% |
| 1.366455474 | 6 | 1.0% |
| 1.520049993 | 6 | 1.0% |
| 0.2912938397 | 6 | 1.0% |
| 2.748806147 | 5 | 0.9% |
| -0.7531488908 | 5 | 0.9% |
| Other values (100) | 200 |
| Value | Count | Frequency (%) |
| -2.319812987 | 1 | 0.2% |
| -2.289094083 | 1 | 0.2% |
| -2.227656275 | 2 | |
| -2.10478066 | 1 | 0.2% |
| -2.074061756 | 2 | |
| -1.889748333 | 1 | 0.2% |
| -1.797591621 | 1 | 0.2% |
| -1.674716006 | 3 | |
| -1.643997102 | 2 | |
| -1.613278198 | 1 | 0.2% |
| Value | Count | Frequency (%) |
| 3.117432993 | 2 | 0.3% |
| 3.086714089 | 1 | 0.2% |
| 3.055995185 | 4 | |
| 2.994557377 | 1 | 0.2% |
| 2.902400666 | 1 | 0.2% |
| 2.871681762 | 1 | 0.2% |
| 2.840962858 | 1 | 0.2% |
| 2.810243954 | 2 | 0.3% |
| 2.748806147 | 5 | |
| 2.687368339 | 1 | 0.2% |
| Distinct | 215 |
|---|---|
| Distinct (%) | 37.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.62563956 × 10-16 |
| Minimum | -2.161400024 |
|---|---|
| Maximum | 2.543884128 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 293 |
| Negative (%) | 50.4% |
| Memory size | 4.7 KiB |
Quantile statistics
| Minimum | -2.161400024 |
|---|---|
| 5-th percentile | -1.552674452 |
| Q1 | -0.7465243696 |
| median | -0.02263450007 |
| Q3 | 0.618995157 |
| 95-th percentile | 1.869350386 |
| Maximum | 2.543884128 |
| Range | 4.705284152 |
| Interquartile range (IQR) | 1.365519527 |
Descriptive statistics
| Standard deviation | 1.000861698 |
|---|---|
| Coefficient of variation (CV) | 1.779107401 × 1015 |
| Kurtosis | -0.3175777543 |
| Mean | 5.62563956 × 10-16 |
| Median Absolute Deviation (MAD) | 0.6745337421 |
| Skewness | 0.2578962716 |
| Sum | 3.455014053 × 10-13 |
| Variance | 1.001724138 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0.1747900098 | 10 | 1.7% |
| 0.04317366989 | 10 | 1.7% |
| 0.1077607898 | 9 | 1.5% |
| -0.02263450007 | 9 | 1.5% |
| 0.1089818398 | 9 | 1.5% |
| -0.08844267003 | 9 | 1.5% |
| -0.2694151374 | 8 | 1.4% |
| 0.3722145197 | 7 | 1.2% |
| 0.2405981798 | 7 | 1.2% |
| 0.3228583922 | 7 | 1.2% |
| Other values (205) | 496 |
| Value | Count | Frequency (%) |
| -2.161400024 | 3 | |
| -2.128495939 | 1 | 0.2% |
| -2.013331641 | 1 | 0.2% |
| -1.980427556 | 1 | 0.2% |
| -1.963975514 | 1 | 0.2% |
| -1.947523471 | 2 | |
| -1.931071429 | 1 | 0.2% |
| -1.881715301 | 1 | 0.2% |
| -1.865263259 | 1 | 0.2% |
| -1.848811216 | 1 | 0.2% |
| Value | Count | Frequency (%) |
| 2.543884128 | 1 | 0.2% |
| 2.527432086 | 1 | 0.2% |
| 2.461623916 | 1 | 0.2% |
| 2.445171873 | 1 | 0.2% |
| 2.428719831 | 2 | |
| 2.362911661 | 1 | 0.2% |
| 2.346459618 | 1 | 0.2% |
| 2.330007576 | 1 | 0.2% |
| 2.297103491 | 3 | |
| 2.280651448 | 1 | 0.2% |
DiabetesPedigreeFunction
Real number (ℝ)
| Distinct | 408 |
|---|---|
| Distinct (%) | 70.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.783720974 × 10-17 |
| Minimum | -1.410612949 |
|---|---|
| Maximum | 3.115961097 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 345 |
| Negative (%) | 59.4% |
| Memory size | 4.7 KiB |
Quantile statistics
| Minimum | -1.410612949 |
|---|---|
| 5-th percentile | -1.167202835 |
| Q1 | -0.7401675481 |
| median | -0.304591555 |
| Q3 | 0.6348860772 |
| 95-th percentile | 1.988587938 |
| Maximum | 3.115961097 |
| Range | 4.526574046 |
| Interquartile range (IQR) | 1.375053625 |
Descriptive statistics
| Standard deviation | 1.000861698 |
|---|---|
| Coefficient of variation (CV) | 1.022986756 × 1016 |
| Kurtosis | 0.1335958208 |
| Mean | 9.783720974 × 10-17 |
| Median Absolute Deviation (MAD) | 0.5978494023 |
| Skewness | 0.9150115222 |
| Sum | 6.75015599 × 10-14 |
| Variance | 1.001724138 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| -0.5992459032 | 5 | 0.9% |
| -0.6291383734 | 5 | 0.9% |
| -0.6590308435 | 5 | 0.9% |
| -0.641949432 | 5 | 0.9% |
| -0.6205976676 | 4 | 0.7% |
| -0.5309202573 | 4 | 0.7% |
| -0.9024409573 | 4 | 0.7% |
| -0.6334087262 | 4 | 0.7% |
| 1.211383715 | 4 | 0.7% |
| -0.6376790791 | 4 | 0.7% |
| Other values (398) | 537 |
| Value | Count | Frequency (%) |
| -1.410612949 | 1 | |
| -1.384990832 | 1 | |
| -1.380720479 | 2 | |
| -1.367909421 | 2 | |
| -1.363639068 | 1 | |
| -1.350828009 | 1 | |
| -1.333746598 | 1 | |
| -1.316665186 | 1 | |
| -1.312394833 | 1 | |
| -1.30812448 | 1 |
| Value | Count | Frequency (%) |
| 3.115961097 | 1 | |
| 3.107420391 | 1 | |
| 3.068987215 | 1 | |
| 3.013472628 | 1 | |
| 2.95795804 | 1 | |
| 2.936606276 | 1 | |
| 2.932335923 | 1 | |
| 2.770062514 | 1 | |
| 2.620600163 | 1 | |
| 2.513841342 | 1 |
| Distinct | 45 |
|---|---|
| Distinct (%) | 7.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.891860487 × 10-17 |
| Minimum | -1.03469052 |
|---|---|
| Maximum | 3.104071561 |
| Zeros | 12 |
| Zeros (%) | 2.1% |
| Negative | 353 |
| Negative (%) | 60.8% |
| Memory size | 4.7 KiB |
Quantile statistics
| Minimum | -1.03469052 |
|---|---|
| 5-th percentile | -1.03469052 |
| Q1 | -0.8465649712 |
| median | -0.3762510983 |
| Q3 | 0.6584394221 |
| 95-th percentile | 2.069381041 |
| Maximum | 3.104071561 |
| Range | 4.138762081 |
| Interquartile range (IQR) | 1.505004393 |
Descriptive statistics
| Standard deviation | 1.000861698 |
|---|---|
| Coefficient of variation (CV) | 2.045973511 × 1016 |
| Kurtosis | 0.3099974798 |
| Mean | 4.891860487 × 10-17 |
| Median Absolute Deviation (MAD) | 0.5643766475 |
| Skewness | 1.057519908 |
| Sum | 7.105427358 × 10-15 |
| Variance | 1.001724138 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=45)
| Value | Count | Frequency (%) |
| -0.9406277458 | 63 | 10.8% |
| -1.03469052 | 54 | 9.3% |
| -0.7525021966 | 37 | 6.4% |
| -0.6584394221 | 34 | 5.9% |
| -0.8465649712 | 31 | 5.3% |
| -0.3762510983 | 29 | 5.0% |
| -0.5643766475 | 26 | 4.5% |
| -0.4703138729 | 26 | 4.5% |
| -0.2821883237 | 20 | 3.4% |
| -0.1881255492 | 17 | 2.9% |
| Other values (35) | 244 |
| Value | Count | Frequency (%) |
| -1.03469052 | 54 | |
| -0.9406277458 | 63 | |
| -0.8465649712 | 31 | |
| -0.7525021966 | 37 | |
| -0.6584394221 | 34 | |
| -0.5643766475 | 26 | |
| -0.4703138729 | 26 | |
| -0.3762510983 | 29 | |
| -0.2821883237 | 20 | 3.4% |
| -0.1881255492 | 17 | 2.9% |
| Value | Count | Frequency (%) |
| 3.104071561 | 2 | |
| 3.010008787 | 1 | 0.2% |
| 2.915946012 | 3 | |
| 2.821883237 | 3 | |
| 2.727820463 | 2 | |
| 2.633757688 | 3 | |
| 2.539694914 | 1 | 0.2% |
| 2.445632139 | 3 | |
| 2.351569364 | 3 | |
| 2.25750659 | 2 |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here. A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
First rows
| level_0 | Pregnancies | Glucose | BloodPressure | SkinThickness | Insulin | BMI | DiabetesPedigreeFunction | Age | |
|---|---|---|---|---|---|---|---|---|---|
| 0 | -1.722625 | 0.652686 | 1.298513 | 0.085112 | 1.255293 | -0.329244 | 0.372215 | 0.933811 | 1.693130 |
| 1 | -1.718069 | -0.837921 | -1.168433 | -0.486978 | 0.500999 | -0.329244 | -0.779428 | -0.244807 | -0.094063 |
| 2 | -1.708959 | -0.837921 | -1.011801 | -0.486978 | -0.253294 | 0.106980 | -0.532648 | -1.030552 | -1.034691 |
| 3 | -1.699848 | 0.354564 | 0.045461 | 0.275809 | -0.563000 | -0.329244 | -0.943949 | -0.885360 | -0.188126 |
| 4 | -1.695293 | -0.241678 | -1.442538 | -2.012552 | 0.878146 | -0.077333 | -0.055539 | -0.684653 | -0.564377 |
| 5 | -1.690738 | 1.845171 | 0.006303 | -0.190877 | -0.563000 | -0.329244 | 0.651899 | -1.171473 | -0.282188 |
| 6 | -1.686182 | 1.248928 | 0.397882 | 2.373473 | -0.563000 | -0.329244 | 0.107761 | -0.752979 | 2.069381 |
| 7 | -1.681627 | 0.056443 | -0.189486 | 1.992080 | -0.563000 | -0.329244 | 1.030296 | -0.928063 | -0.188126 |
| 8 | -1.677072 | 1.845171 | 2.081671 | 0.275809 | -0.563000 | -0.329244 | 1.096104 | 0.549479 | 0.188126 |
| 9 | -1.667961 | 0.354564 | 2.003355 | 0.085112 | -0.756156 | 2.595212 | -0.911045 | 0.762997 | 1.787193 |
Last rows
| level_0 | Pregnancies | Glucose | BloodPressure | SkinThickness | Insulin | BMI | DiabetesPedigreeFunction | Age | |
|---|---|---|---|---|---|---|---|---|---|
| 571 | 1.684740 | 0.950807 | 0.867777 | 1.801383 | 2.009586 | -0.329244 | 0.108982 | -0.073992 | 0.658439 |
| 572 | 1.689295 | -1.136042 | 0.319566 | 0.085112 | -0.563000 | -0.329244 | 0.816420 | -0.641949 | 1.881255 |
| 573 | 1.693850 | -0.837921 | -0.346117 | 0.466505 | -0.563000 | -0.329244 | 1.013844 | -0.902441 | -0.564377 |
| 574 | 1.702961 | -0.539800 | -1.050959 | -1.249765 | 0.123853 | -2.289094 | -0.483292 | 1.527390 | -0.940628 |
| 575 | 1.707516 | 1.547050 | 2.159986 | 0.275809 | 0.752431 | -0.329244 | 2.083227 | -0.022748 | 1.034691 |
| 576 | 1.712072 | 1.547050 | -1.011801 | -0.868372 | -0.563000 | -0.329244 | -1.453962 | -1.137310 | 0.094063 |
| 577 | 1.721182 | -0.539800 | 0.280409 | -0.105585 | 0.249568 | -0.329244 | 0.898680 | -0.291780 | -0.470314 |
| 578 | 1.725737 | 0.354564 | 0.241251 | 0.085112 | -0.253294 | 0.659921 | -0.845237 | -0.697464 | -0.188126 |
| 579 | 1.730293 | -0.837921 | 0.437040 | -1.059069 | -0.563000 | -0.329244 | -0.203607 | -0.253347 | 1.410942 |
| 580 | 1.734848 | -0.837921 | -0.855170 | -0.105585 | 0.752431 | -0.329244 | -0.154251 | -0.398539 | -0.846565 |